Likelihood decision boundary estimation between HMM pairs in speech recognition
نویسندگان
چکیده
In maximum likelihood estimation of hidden Markov models for speech recognition, the criterion is to maximize the total probability across the training data for a particular speech unit, such as a word, monophone, diphone, or triphone. Since each unit model is trained separately, such a strategy can often lead to biases among decision boundaries of the generated model set. In this paper, we propose a new technique to minimize the total number of misclassiications in the training data set by adjusting the decision boundaries between hidden Markov model pairs. The proposed algorithm is shown to reduce the error rate in a number of speech recognition tasks such as accent detection, language identiication, and confusable word pair discrimination. The technique is also attractive because it is simple to implement and the improvement in performance is achieved without any added complexity in the decoding phase.
منابع مشابه
Feature dimension reduction using reduced-rank maximum likelihood estimation for hidden Markov models
This paper presents a new method of feature dimension reduction in hidden Markov modeling (HMM) for speech recognition. The key idea is to apply reduced rank maximum likelihood estimation in the M-step of the usual Baum-Welch algorithm for estimating HMM parameters such that the estimates of the Gaussian distribution parameters are restricted in a sub-space of reduced dimensionality. There are ...
متن کاملDictionary-based discriminative HMM parameter estimation for continuous speech recognition systems
The estimation of the HMM parameters has always been a major issue in the design of speech recognition systems. Discriminative objectives like Maximum Mutual Information (MMI) or Minimum Classi cation Error (MCE) have proved to be superior over the common Maximum Likelihood Estimation (MLE) in cases where a robust estimation of the probabilistic density functions (pdfs) is not possible. The det...
متن کاملAn improved training algorithm in HMM-based speech recognition
In HMM-based speech recognition, estimation of parameters of HMMs is viewed as counterpart of training or learning in traditional sequential pattern recognition since speech signal can be represented by a sequence of ndimension vectors after features are extracted from the speech signal. However, due to variation of duration of the phone with speakers and context and its randomness, speech samp...
متن کاملPredictive hidden Markov model selection for decision tree state tying
This paper presents a novel predictive information criterion (PIC) for hidden Markov model (HMM) selection. The PIC criterion is exploited to select the best HMMs, which provide the largest prediction information for generalization of future data. When the randomness of HMM parameters is expressed by a product of conjugate prior densities, the prediction information is derived without integral ...
متن کاملMaximum likelihood noise HMMm estimation in model-based robust speech recognition
This paper presents a generalization of Rose's Integrated Parametric Model to the gaussian mixture hidden Markov model (HMM), formulation. Observations from clean speech HMM and noise HMM models are combined in the log spectra domain, through a corruption function, to generate noisy speech observations. In order to recognize noisy speech with the proposed model, when only the clean speech HMM a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Trans. Speech and Audio Processing
دوره 6 شماره
صفحات -
تاریخ انتشار 1998